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(4)  Introduction 


Treatment  of  breast  cancer  at  an  early  stage  can  significantly  improve  the  survival  rate  of 
patients.  Mammography  is  currently  the  most  sensitive  method  for  detecting  early  breast  cancer, 
and  it  is  also  the  most  practical  for  screening.  Although  general  rules  for  differentiation  between 
malignant  and  benign  lesions  exist,  in  clinical  practice,  approximately  only  15-30%  of  cases 
referred  to  surgical  biopsy  are  actually  malignant.  A  number  of  research  groups  are  in  the 
process  of  developing  computer-aided  diagnosis  (CAD)  methods  which  can  provide  a  consistent 
and  reproducible  second  opinion  to  the  radiologist  for  the  detection  and  classification  of  breast 
abnormalities. 

Radiologists  routinely  compare  mammograms  from  a  current  examination  with  those 
obtained  in  previous  years,  if  available,  for  identifying  interval  changes,  detecting  potential 
abnormalities,  and  in  evaluating  breast  lesions.  It  is  widely  accepted  that  interval  changes  in 
mammographic  features  are  very  useful  for  both  detection  and  classification  of  abnormalities. 
However,  CAD  techniques  that  use  multiple  exams  for  detection  or  characterization  have  not 
been  commonly  explored,  probably  because  of  the  difficulty  in  the  registration  of  the  compressed 
breast  images  from  different  exams.  We  have  been  investigating  methods  for  analysis  of 
temporal  changes  of  masses  on  mammograms  to  improve  detection  and  classification.  To  our 
knowledge,  there  is  no  existing  CAD  technique  for  registration  of  microcalcification  clusters  or 
classification  of  microcalcifications  based  on  temporal  change  information. 

The  extraction  of  any  meaningful  information  from  a  prior  mammogram  first  requires  a 
common  frame  of  reference  between  the  current  and  prior  mammograms.  Several  complicating 
factors,  such  as  breast  compression  difference  between  current  and  prior  mammograms,  energy 
difference  between  the  two  imaging  conditions,  differences  in  screen  film  properties  and  film 
processing  conditions,  and  potential  changes  in  breast  structures  between  the  two  images  with 
patient  age,  make  it  difficult  to  obtain  such  a  frame  of  reference.  On  breast  images,  there  are  no 
invariant  landmarks  (except  for  the  nipple)  that  can  serve  as  control  points  in  conventional  image 
registration  methods  to  register  the  two  mammograms.  In  this  project,  we  propose  to  develop  an 
innovative  regional  registration  method  that  does  not  depend  on  specific  control  points.  We  will 
first  approximately  align  the  current  and  prior  mammogram  based  on  maximization  of  mutual 
information.  Next,  we  will  design  a  novel  approach  in  which  the  computer  emulates  the 
radiologists’  search  method  in  finding  corresponding  lesions  on  mammograms.  Automated 
search  of  microcalcification  cluster  within  the  search  region  on  the  prior  mammogram  will  be 
performed.  Our  current  automated  microcalcification  detection  algorithm  will  provide  a  basis 
for  this  search.  However,  since  the  detection  is  limited  to  the  small  search  region,  the  detection 
can  be  performed  in  high  resolution  and  the  algorithm  parameters  can  be  adjusted  to  improve  the 
detection  sensitivity  of  the  very  subtle  clusters  on  the  prior  mammograms  without  excessive 
trade-off  in  increasing  false-positives.  A  correspondence  classifier  will  be  developed  to  identify 
the  matched  pair  of  clusters  on  the  two  mammograms.  The  image  features  of  the  corresponding 
microcalcification  clusters  can  then  be  automatically  extracted  and  feature  measures 
characterizing  interval  changes  derived.  A  classification  scheme  to  differentiate  malignant  and 
benign  clusters  using  the  interval  change  information  will  be  developed.  This  computerized 
interval  change  analysis  will  be  an  important  component  of  a  CAD  system  for  mammographic 
interpretation. 

This  project  aims  at  developing  a  novel  interval  change  analysis  scheme  to  improve  the 
accuracy  of  CAD.  We  will  investigate  the  problem  of  classifying  microcalcifications  as 
malignant  or  benign  based  on  temporal  changes  in  mammographic  features  using  a  combination 
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of  computer  vision,  automated  feature  extraetion,  statistieal  elassifieation,  and  artifieial 
intelligenee  teehniques.  We  hypothesize  that  the  use  of  temporal  information  would  improve  the 
ability  of  CAD  to  offer  an  aeeurate  and  objeetive  seeond  opinion  to  radiologists  whieh,  in  turn, 
would  inerease  the  positive  predietive  value  of  mammography,  reduee  the  number  of  benign 
biopsies,  and  henee  reduee  both  eost  and  patient  morbidity.  If  integrated  in  a  eomplete  CAD 
system,  the  algorithms  to  be  developed  in  this  projeet  may  also  inerease  the  effieacy  of 
mammography  for  early  deteetion  of  breast  eaneer. 
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(5)  Body 


In  the  fourth  year  (7/1/05-6/30/06)  of  this  grant,  we  have  performed  the  following  studies: 

(A)  Database  collection  of  malignant  and  benign  breast  microcalcification  cases  that  have 
multiple  examinations  (Task  1) 

We  eontinued  eollecting  the  data  set  for  this  study  from  the  files  of  patients  who  had 
undergone  biopsy  at  the  University  of  Miehigan.  The  mammograms  are  seanned  and  the  images  are 
saved  in  our  storage  deviee  using  automated  graphie  user  interface  developed  in  our  laboratory. 
Additionally  the  film  information  is  recorded  in  a  Microsoft  Access  database.  Temporal  pairs  of 
images  were  obtained.  The  current  mammogram  of  each  temporal  pair  exhibited  a  biopsy-proven 
mass.  We  scan  both  cranio-caudal  (CC)  and  mediolateral-oblique  (MLO)  views.  The 
mammograms  were  digitized  with  a  LUMISCAN  85  laser  scanner  at  a  pixel  resolution  of  0.05  mm 
X  0.05  mm  and  with  12-bit  resolution. 

While  the  regional  registration  technique  can  be  used  for  determining  a  corresponding 
structure  or  region  for  any  structure  (both  normal  tissues  and  masses)  in  the  breast,  in  this  study  we 
are  analyzing  its  accuracy  on  biopsy-proven  masses  alone.  The  location  of  the  mass  on  the  current 
mammogram  is  identified  by  an  Mammography  Quality  Standards  Act  (MQSA)-approved 
radiologist  experienced  in  breast  imaging  using  an  interactive  image  analysis  tool  on  a  UNIX 
workstation.  To  provide  the  ground  truth  for  evaluation  of  the  computerized  method,  the 
radiologist  manually  identifies  the  corresponding  region  on  the  prior  mammogram.  Bounding 
polygons  enclosing  the  microcalcification  cluster  on  the  current  mammogram  and  the 
corresponding  object  on  the  prior  mammogram  are  provided  by  the  radiologist  for  each  case.  Each 
microcalcification  cluster  as  well  as  the  corresponding  structure  on  the  prior  mammogram  are  rated 
for  its  visibility  on  a  scale  of  1  to  10,  where  the  rating  of  1  corresponded  to  the  most  visible 
category.  The  size  of  the  microcalcification  cluster  on  the  current  mammogram  as  well  as  the  size 
of  the  corresponding  structure  on  the  prior  mammogram  are  also  measured  by  the  radiologist.  The 
parenchymal  density  is  rated  based  on  the  Breast  Imaging  Reporting  and  Data  System  (BI-RADS) 
lexicon. 


(B)  Development  of  feature  measures  and  temporal  classifier  for  characterization  of 
temporal  changes  in  microcalcification  clusters,  (Task  6) 

In  the  past  year  of  the  project  we  made  a  major  progress  in  the  direction  of  designing  an 
automatic  CAD  system  for  characterization  of  temporal  cases  on  malignant  and  benign.  This  is 
very  novel  and  unique  CAD  system  which  includes  an  automatic  registration  of  the  corresponding 
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current  and  prior  clusters  and  then  classifying  them  by  a  temporal  classifier  on  malignant  and 
benign  (Figure  1).  For  this  purpose  the  feature  extraction  and  classification  were  earried  out  with 
the  clusters  and  individual  microcalcifications  obtained  automatically  from  the  registration  stage 
[1,  6,  9].  The  design  of  an  automatic  CAD  also  presents  ehallenges.  The  automatically  detected 
eluster  locations  on  the  temporal  pairs  may  deviate  from  the  optimal  locations  as  selected  by  expert 
radiologists.  This  will  introduce  “noise”  to  the  extracted  features  and  make  the  classification  task 
more  difficult.  In  addition  for  some  cases  there  is  no  detection  on  prior  mammogram  and  only 
current  information  can  be  used. 


Stage  1 


Stage  2 


Figure  1.  Bloek-diagram  of  the  registration  -  elassifieation  CAD  for  temporal  microealcification 
clusters. 


Feature  extraction  and  definition  of  difference  features 

In  this  study,  a  new  classification  scheme  using  interval  change  information  was  developed 
to  elassify  mammographic  mieroealcification  clusters  as  malignant  and  benign  [9]  (Figure  2). 

From  eaeh  automatically  detected  cluster,  20  run  length  statistic  texture  features  (RLSF) 
and  21  morphological  (Mo)  features  were  extracted  [7].  Additionally,  78  SOLD  [2]  and  64  GLDS 
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[3,  4]  texture  features  were  also  extraeted.  All  texture  features  (RLSF,  SOLD,  and  GLDS)  were 
extraeted  from  automatieally  detected  cluster  locations.  The  morphological  features  were  extracted 
from  the  automatically  detected  microcalcifications  within  the  automatically  detected  cluster 
locations. 


Likelihood  of  malignancy 


Figure  2.  Classification  of  temporal  pairs  of  microcalcification  clusters  on  malignant  and 
benign  combining  temporal  classifier  for  the  TP-TP  and  TP-FP  pairs  and  current  classifier  for 
the  cases  without  detection  on  prior. 

Twenty  difference  RLSF  were  obtained  by  subtracting  a  prior  RLSF  from  the 
corresponding  current  RLSF.  We  have  designed  a  new  feature,  the  ratio  feature,  which  is  defined 
as  the  ratio  between  current  and  prior  feature  [5,  9].  We  have  obtained  21  Mo  ratio  features.  In 
addition  we  used  current  GLDS  features. 


Table  1 .  The  feature  vector  used  for  the  temporal  classifier. 


Generated  feature  type 

Ratio  features 

Difference  features 

Current  features 

Features 

Mo 

RLSF 

GLDS 

Number  of  features 

21 

20 

64 

The  feature  space  consisted  of  the  Mo  ratio  features,  the  difference  RLSF,  as  well  as  the 
current  GLDS  features  (Table  1). 
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We  are  eontinuing  to  investigate  the  possibilities  to  define  and  extract  additional  sets  of 
features  in  order  to  improve  the  classification  accuracy. 


Figure  3.  Temporal  classifier  for  classification  of  temporal  pairs  of  microcalcification  clusters 
on  malignant  and  benign. 


Classifier  design 

When  we  are  designing  the  classification  system  for  classification  of  microcalcification 
clusters  on  malignant  and  benign  we  have  to  consider  the  fact  that  the  microcalcification  detection 
algorithm  may  not  be  able  to  detect  all  clusters  on  prior.  If  the  cluster  is  very  subtle  the  automatic 
microcalcification  detection  for  the  identification  of  mirocalcification  clusters  may  not  be  able  to 
detect  it.  In  this  case  two  types  of  classifiers  are  designed  (Figure  2).  A  temporal  classifier  (Figure 
3)  based  on  current  and  prior  information  is  used  if  a  cluster  is  detected  in  the  prior,  and  a  current 
classifier  (Figure  6)  based  on  current  information  alone  is  used  if  no  prior  cluster  is  detected.  The 
temporal  classifier  is  trained  to  classify  the  true  (TP-TP)  and  false  (TP-FP)  pairs.  The  automatically 
detected  cluster  locations  on  the  temporal  pairs  may  deviate  from  the  optimal  locations  as  selected 
by  expert  radiologists.  This  will  introduce  “noise”  to  the  extracted  features  and  make  the 
classification  task  more  difficult. 
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In  the  past  year  of  the  projeet  we  have  eoneentrated  our  efforts  on  using  different  types  of 
elassifieation  methods  in  the  temporal  elassifier  in  order  to  elassify  the  temporal  pair  on  malignant 
and  benign.  We  have  used  a  linear  diseriminant  analysis  elassifier  (LDA),  support  veetor  maehine 
elassifier  (SVM)  [8]  and  a  multilayer  pereeptron  neural  network  with  baekpropagation  training 
(NN).  A  stepwise  feature  seleetion  with  floating  window  of  3  were  used  to  seleet  the  most  useful 
feature  subsets  and  to  merge  the  feature  into  a  diseriminant  seore  (Figure  3).  A  leave-one-ease-out 
resampling  seheme  was  used  for  feature  seleetion  and  to  train  and  to  test  the  elassifiers. 

In  addition  a  eurrent  elassifier  was  trained  using  the  eurrent  images  from  the  temporal  pairs 
(the  eases  that  has  no  deteetion  on  prior)  and  tested  on  the  eases  that  have  not  deteetion  on  prior 
(Figure  6).  LDA  classifier  was  used  in  this  task.  The  classification  accuracy  was  analyzed  by 
receiver  operating  characteristic  (ROC)  methodology. 

In  this  study,  175  serial  pairs  containing  biopsy-proven  calcification  clusters  were  used.  On 
the  priors,  the  radiologist  rated  12  of  the  175  clusters  as  not  visible  and  the  subtlety  of  18  clusters 
as  9  or  10  on  a  scale  of  10. 

At  the  first  stage  of  the  system  (Figure  1),  85%  (149/175)  of  the  TP-TP  pairs  were 
identified  with  15  false  matches  within  the  164  image  pairs  that  had  computer  detected  clusters 
on  the  priors.  Therefore  164  temporal  image  pairs  (149  TP-TP  and  15  TP-FP),  of  which  49  were 
malignant,  were  used  for  classification  as  malignant  and  benign.  At  the  second  stage  (Figure  1), 
the  SVM,  LDA  and  NN  temporal  classifiers  were  used  for  the  classification  of  the  164  cluster 
temporal  pairs. 

For  each  of  LDA  and  SVM  (both  radial  and  dot  kernel)  temporal  classifiers  a  feature 
selection  was  performed.  Feature  selection  for  NN  is  a  very  competitively  intensive  process  and 
because  of  this  for  the  NN  temporal  classifier  we  used  the  best  feature  set  obtained  for  the  LDA 
temporal  classifier. 

LDA  temporal  classifier 

When  an  LDA  temporal  classifier  was  used  the  best  result  was  obtained  for  an  average  of 
7  features  selected  (Table  2).  The  selected  features  included  1  difference  RLS  feature,  4 
morphological  ratio  features  and  2  GLDS  features  from  the  current  image.  All  the  features  were 
consistently  selected  for  all  the  training  partitions.  The  LDA  temporal  classifier  achieved  test  Az 
of  0.83±0.03  (Table  4).  The  Az  results  for  the  LDA  temporal  classifier  with  different  number  of 
selected  features  is  presented  in  Figure  4. 
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Table  2.  Selected  features  for  the  combined  classifier  for  classification  on  malignant  and  benign  of 
automatically  detected  clusters  in  current  and  prior  mammograms. _ _ 


Classifier 

Feature  type 

Ratio  features 

Difference  features 

Current  features 

Temporal  classifier 
LDA,NN 

Mo 

4 

RLSF 

1 

GLDS 

2 

Temporal  classifier 
SVM  (dot  kernel) 

Mo 

3 

RLSF 

1 

GLDS 

3 

Temporal  classifier 
SVM  (radial  kernel) 

Mo 

3 

RLSF 

1 

GLDS 

3 

SVM  temporal  classifier 

SVMs  with  dot  kernel,  radial  kernel  and  tanh  kernel  (neural  kernel)  [8]  were  studied  [10]. 
The  training  and  testing  results  are  presented  in  Table  3. 

The  SVM  temporal  classifier  with  dot  kernel  achieved  training  Az  of  0.87±0.03  and  test 
Az  of  0.82±0.03.  The  selected  features  for  the  dot  kernel  included  1  difference  RLS  feature,  3 
morphological  ratio  features  and  3  GLDS  features  from  the  current  image.  The  performance  of 
the  SVM  with  dot  kernel  for  the  different  number  of  selected  features  is  presented  in  Figure  4. 
For  the  small  number  of  features  the  performance  of  the  SVM  with  dot  kernel  was  close  to  LDA. 
However  for  the  larger  number  of  features  the  LDA  slightly  over-performed  the  SVM  with  dot 
kernel. 

The  SVM  temporal  classifier  with  radial  kernel  achieved  training  Az  of  0.86±0.03  and 
test  Az  of  0.81±0.03  (Table  3).  For  this  kernel  the  selected  features  were  also  1  difference  RLS 
feature,  3  morphological  ratio  features  and  3  GLDS  features  from  the  current  image  (Table  2). 
The  Az  values  of  the  SVM  temporal  classifier  with  radial  kernel  were  consistently  smaller  than 
that  for  the  SVM  with  dot  kernel  for  the  different  number  of  features.  (Figure  4). 

The  SVM  temporal  classifier  with  tanh  kernel  (neural  kernel)  achieved  training  Az  of 
0.86±0.03  and  test  Az  of  0.82±0.03  (Table  3).  However,  it  required  83  support  vectors  and  it  was 
slower  to  train.  Because  of  that  we  did  not  perform  feature  selection  used  the  best  feature  set  we 
had  from  the  LDA  classifier. 

The  SVM  with  dot  kernel  performed  the  best  compared  to  the  SVMs  with  other  kernels. 
It  also  required  the  smallest  number  of  support  vectors. 
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NN  temporal  classifier 

A  NN  with  7  input  nodes,  different  number  of  neurons  in  the  hidden  layer,  and  1  output 
node  was  used.  The  NN  aehieved  the  best  result  (Az  =  0.84±0.03)  for  8  hidden  nodes  (Figure  5). 
The  performanee  of  the  NN  temporal  classifier  as  a  function  of  the  number  of  the  hidden  neurons 
is  presented  in  Figure  5.  For  most  of  the  NN  structures  the  test  Az  was  0.83  or  lower.  For  larger 
number  of  hidden  neurons  we  have  observed  slight  overtraining.  For  8  hidden  nodes  there  was  a 
pick  in  Az,  which  corresponded  to  the  best  result. 

Table  4  summarizes  the  results  of  using  different  classification  methods  for  the  temporal 
classifier.  The  difference  in  Az  between  any  one  of  the  classifiers  did  not  achieve  statistical 
significance.  LDA  classifier  revealed  good  stable  performance  with  very  simple  structure  of  the 
classifier.  This  was  the  reason  to  be  selected  for  the  observer  study  experiment. 

In  comparison,  the  MQSA  radiologist  achieved  an  Az  of  0.72±0.04  for  the  164  temporal 

pairs. 


2  3  4  5  6  7 

Number  of  Features 


Figure  4.  Test  Az  results  for  SVM  (radial  and  dot  kernels)  and  LDA  with  different  number  of 
selected  features. 
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The  difference  RLS  texture  features,  morphological  ratio  features  and  GLDS  current 
features  were  useful  for  identification  of  malignancy  in  temporal  pairs  of  mammograms.  The 
information  on  the  prior  image  improved  characterization  of  the  microcalcification  clusters:  5  out 
of  the  7  selected  features  contained  prior  information  for  the  LDA  and  NN;  4  out  of  the  7  selected 
features  contained  prior  information  for  the  SVM. 


2  4  6  8  10  12  14  16 


Number  of  Hidden  Neurons 

Figure  5.  Test  results  for  NN  with  different  number  of  neurons  in  the  hidden  layer. 


Table  3.  SVM  classification  results  based  on  7  selected  features. 


SVM  Type 

Training  Az 

Test  Az 

Number 

SV 

Dot  Kernel 

0.87±0.03 

0.82±0.03 

78 

Radial  Kernel 

0.86±0.03 

0.81±0.03 

89 

Tanh  Kernel 

0.86±0.03 

0.82±0.03 

83 
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Table  4.  Overall  results  for  the  LDA,  SVM,  NN  classifers. 


Training  Az 

Test  Az 

LDA  Classifier  based  on  temporal  pairs 

0.87±0.03 

0.83±0.03 

SVM  Classifier  based  on  temporal  pairs 

0.87±0.03 

0.82±0.03 

NN  Classifier  based  on  temporal  pairs 

0.87±0.03 

0.84±0.03 

LDA  Classifier  based  on  current  images  alone 

0.82±0.03 

0.76±0.04 

MQSA  radiologist 

- 

0.72±0.04 

For  the  eurrent  elassifier  that  was  elassifying  1 1  elusters  without  detection  on  the  prior, 
an  average  of  6  features  from  the  current  images  were  selected  (Table  5).  The  selected  features 
included  1  current  RLS  feature,  3  current  morphological  features  and  2  GLDS  current  features. 
All  the  features  were  consistently  selected  for  all  the  training  partitions.  The  test  Az  by  the 
current  classifier  was  0.72±0.14  for  classifying  the  clusters  as  malignant  or  benign. 


Detected  clusters 
on  current 
mammograms 


Likelihood  of 
malignancy 


Figure  6.  Current  classifier  based  on  current  information  alone.  It  is  used  in  the  case  if  no  prior 
cluster  is  detected. 
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Table  5.  Selected  features  for  the  combined  classifier  for  classification  on  malignant  and  benign  of 
automatically  detected  clusters  in  current  and  prior  mammograms. _ _ 


Classifier 

Feature  type 

Ratio  features 

Difference  features 

Current  features 

Current  classifier 

Mo 

3 

RLSF 

1 

OLDS 

2 

In  the  future  year  we  will  continue  to  develop,  improve  and  compare  classification 
approaches  in  order  to  classify  the  clusters  on  malignant  and  benign  based  on  automatically 
detected  clusters  and  individual  microcalcifications. 


(C)  Compare  the  classification  accuracy  of  the  classification  scheme  using  temporal  change 
information  with  that  of  a  classifier  using  single-exam  information  alone.  (Task  7), 

We  have  performed  a  comparison  between  the  temporal  classifier  and  classifier  based  on 
current  images  (Table  6).  The  test  Az  for  the  classifier  based  on  164  current  images  was 
0.76±0.04.  We  see  an  improvement  for  the  temporal  classifier  (Az=0.83±0.03)  when  compared 
to  the  current  classifier.  We  will  continue  to  perform  comparisons  for  the  different  design  of 
both  the  temporal  pair  classifiers  and  the  current  classifiers. 


Table  6.  Results  for  the  combined  classifier  for  classification  on  malignant  and  benign  of 
automatically  detected  clusters  in  current  and  prior  mammograms. _ 


Classifier  Type 

Temporal  classifier 

Current  classifier 

No.  of  pairs  (or  only  current  images) 

164 

11 

Number  of  selected  features 

7 

6 

Az 

0.83±0.03 

0.76±0.04 

(D)  Conduct  observer  performance  study  to  compare  radiologists’  classification  of 
malignant  and  benign  microcalcifications  with  and  without  the  aid  of  the  temporal  change 
classifier.  (Task  8). 

We  started  performing  a  pilot  study  as  a  first  step  to  design  an  observer  performance 
experiment  with  ROC  methodology  to  evaluate  the  effects  of  computer  classification  on 
radiologists’  estimates  of  the  likelihood  of  malignancy  of  temporal  pairs  of  microcalcification 
clusters  [11,  12].  A  graphical  user  interface  was  developed  on  a  PC  to  display  side-by-side  the 
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temporal  pairs  of  masses  in  a  predesigned  random  order  for  eaeh  observer.  The  likelihood  of 
malignaney  and  the  BI-RADS  assessment  of  the  radiologist  on  each  pair  is  automatically 
recorded  when  they  select  it  on  a  slider. 

175  pairs  of  serial  mammograms  from  65  patients  containing  clustered 
microcalcifications  (51  malignant  and  124  benign)  were  chosen  from  patient  files.  The  true 
cluster  locations  were  identified  by  an  MQSA  radiologist  on  all  current  mammograms  and  164  of 
the  priors.  Regions  of  interest  containing  the  corresponding  clusters  were  extracted  from  the 
current  and  prior  mammograms  of  each  pair  and  analyzed  by  the  CAD  system.  All  cases 
eventually  underwent  biopsy  so  that  interval  change  was  observed  for  most  of  the  clusters  even  if 
they  were  found  to  be  benign  after  biopsy.  This  was  therefore  a  difficult  data  set  for  interval 
change  analysis. 

Four  MQSA  radiologists,  different  from  the  one  who  identified  the  clusters,  assessed  the 
temporal  pairs  that  were  displayed  side-by-side  on  a  workstation.  The  cases  were  read  in  a 
counter-balanced  design.  The  readers  provided  estimates  of  the  LM  and  BI-RADS  assessment 
without  and  then  with  CAD.  The  classification  accuracy  was  quantified  by  the  area  under  ROC 
curve,  Az. 

The  CAD  system  alone  achieved  a  test  Az  value  of  0.83  for  this  data  set.  For  the  four 
radiologists,  the  average  Az  in  estimating  the  LM  was  0.70  (range:  0.64  -  0.75)  without  CAD  and 
improved  to  0.77  (range:  0.68  -  0.83)  with  CAD.  The  improvement  was  statistically  significant 
(p=0.04).  The  ROC  study  is  ongoing  to  include  additional  radiologists  as  observers. 

This  pilot  study  indicates  that  CAD  using  interval  change  analysis  can  significantly  improve 
radiologists’  accuracy  in  classification  of  clustered  microcalcifications  on  serial  mammograms  and 
it  may  be  useful  for  assisting  radiologists  in  classification  of  masses  and  thereby  reducing 
unnecessary  biopsies. 

This  pilot  study  will  be  the  basis  for  our  design  of  a  full-scale  ROC  study,  were  the 
additional  radiologists  will  provide  estimates  of  the  likelihood  of  malignancy  and  BI-RADS 
assessment  without  and  with  CAD.  We  are  in  process  of  recruiting  additional  breast  radiologists 
at  our  department  to  participate  as  observers.  The  sample  size  is  acceptable  but  we  are  continuing 
to  enlarge  the  data  set  until  the  ROC  study  design  is  finalized.  We  also  are  working  on  the  further 
improvement  of  the  accuracy  of  the  interval  change  classifier.  We  expect  that  this  ROC  study  can 
be  completed  within  the  no  cost  time  extension  year  approved  for  this  grant.  This  type  of 
observer  study  is  new  and  unique  and  the  outcome  will  be  important,  providing  a  new 
understanding  of  the  potentials  of  computer  aid  to  the  radiologists  in  characterization  of  the 
temporal  changes  of  mammographic  masses. 
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(6)  Key  research  accomplishments  in  current  year  as  a  result  of  this  grant 


Increase  of  the  temporal  mieroealeifioation  database  (eollection  of  new  temporal  cases  and 
extraetion  of  regions  of  interest)  (Task  1). 

Development  of  LDA,  SVM  and  NN  elassifiers  for  elassifioation  on  malignant  and  benign 
interval  elusters  based  on  automatieally  deteeted  elusters  and  individual  mieroealeifioations 
(Task  6). 

Comparison  of  temporal  elassifier  and  elassifier  based  on  current  mieroealeifieation  elusters 
(Task  7). 

Performing  a  pilot  observer  study  with  the  radiologist  evaluating  temporal  pairs  of 
mieroclacifioation  elusters  without  CAD  (Task  8). 


(7)  Reportable  Outcomes 

Publications  in  current  year  as  a  result  of  this  grant 


[1]  Hadjiiski  LM,  HP  Chan,  B  Sahiner,  MA  Helvie,  MA  Roubidoux,  C  Zhou,  “Automated 
regional  registration  and  elassifioation  of  oorresponding  mieroealeifioation  elusters  on  serial 
mammograms,”  97^*  Scientific  Assembly  and  Annual  Meeting  of  the  Radiological  Society  of 
North  America,  Chioago,  IL,  Nov.  27-Deo  2,  2005.  pp.  270. 

[2]  Hadjiiski  L,  D  Drouillard,  HP  Chan,  B  Sahiner,  MA  Helvie,  MA  Roubidoux,  C  Zhou, 
“Characterization  of  oorresponding  mieroealeifioation  elusters  on  temporal  pairs  of 
mammograms  for  interval  ohange  analysis:  oomparison  of  elassifiers”  Proc.  SPIE  Medical 
Imaging,  2006,  6144:  5Q1-5Q6. 

[3]  Hadjiiski  L,  HP  Chan,  B  Sahiner,  MA  Roubidoux,  MA  Helvie,  CParamagul,  AV  Nees,  J 
Bailey,  SK  Patterson,  “ROC  Study:  Effeots  of  Computer-Aided  Diagnosis  on  Radiologists’ 
Characterization  of  Malignant  and  Benign  Breast  Clustered  Miorooaloifications  in  Temporal 
Pairs  of  Mammograms,”  To  be  presented  at  the  92nd  Scientific  Assembly  and  Annual 
Meeting  of  the  Radiological  Society  of  North  America,  Chioago,  IL.  Nov.  26  -  Deo.  1,  2006. 

[4]  Hadjiiski  L,  HP  Chan,  B  Sahiner,  AV  Nees,  J  Bailey,  SK  Patterson,  MA  Roubidoux,  MA 
Helvie,  C  Paramagul,  “Computer-Aided  Charaoterization  of  Malignant  and  Benign  Breast 
Clustered  Mieroealeifioations  on  Serial  Mammograms:  Early  Experience  of  its  Effeots  on 
Radiologists’  Performanoe  from  an  ROC  Study,”  Eduoation  exhibit  at  the  92nd  Scientific 
Assembly  and  Annual  Meeting  of  the  Radiological  Society  of  North  America,  Chicago,  IL. 
Nov.  26 -Dec.  1,2006. 
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Copies  of  publications  are  enclosed  with  this  report. 
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(8)  Conclusion 


As  a  result  of  the  support  by  the  USAMRMC  grant,  in  the  fourth  year  of  this  project,  we 
have  (1)  collected  additional  cases  with  temporal  microcalcification  clusters;  (2)  Extracted  texture 
and  morphological  features  from  matching  corresponding  microcalcification  clusters  on  current 
and  prior  mammograms  from  automatically  detected  microcalcification  clusters;  (3)  Develop 
feature  measures  and  temporal  classifier  for  characterization  of  temporal  changes  in  automatically 
detected  microcalcification  clusters;  (4)  Compare  the  performance  of  EDA,  SVM  andNN  temporal 
classifiers  (5)  Compare  the  temporal  classifier  with  classifier  based  on  current  images  only;  (6) 
Perform  a  pilot  observer  performance  study. 

The  results  obtained  so  far  are  encouraging.  The  new  classification  scheme,  using  interval 
change  information,  to  classify  mammographic  microcalcification  clusters  as  malignant  and  benign 
showed  promising  results.  Morphological  ratio  features,  difference  RES  features  and  current  GLDS 
features  were  useful  for  the  classification. 

We  have  made  a  major  progress  in  the  direction  of  the  ultimate  goal  of  the  project  -  to  have 
an  automatic  CAD  system  for  characterization  of  temporal  cases  on  malignant  and  benign.  Eor  that 
purpose  a  feature  extraction  and  classification  was  carried  out  with  the  clusters  and  individual 
microcalcifications  obtained  automatically  from  the  registration  stage.  Based  on  these  features  we 
have  designed  a  classification  scheme  to  classify  the  automatically  registered  and  detected 
microcalcification  clusters.  This  temporal  classifier  performed  better  than  the  classifier  based  on 
current  images  only. 

We  performed  a  pilot  observer  study  for  evaluation  of  the  radiologist  performance  without 
and  with  CAD.  Eour  radiologists  participated  in  the  study.  The  radiologists  improved  their 
performance  when  using  CAD.  The  improvement  was  statistically  significant. 

The  design  of  an  automatic  CAD  can  also  present  challenges.  The  automatically  detected 
cluster  locations  on  the  temporal  pairs  may  deviate  from  the  optimal  locations  as  selected  by  expert 
radiologists.  This  will  introduce  “noise”  to  the  extracted  features  and  make  the  classification  task 
more  difficult.  In  the  feature  year  we  will  continue  to  improve  all  the  stages  of  the  automatic 
detection  and  classification  system  with  the  aim  to  improve  the  classification  results  for 
microcalcification  clusters  obtained  automatically  from  the  registration  stage. 
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from  sensitivifr-  and  specificin-  were  determined 

from  all  3  types  of  mteipretations.  Paired  t-tests  were  performed  to 
determw  the  significance  of  differences  in  the  average  scStivities  and 
specificities  without  and  with  CAD 

■■  birads  and  CR.  it  was  not  possible  for  e^-er^'  obser^■er 

^  ^  agreement  with  the 

actual  PMD.  Howe\er,  use  of  user-dependent  cutoffs  resulted  in  better 
agreement  than  use  of  constant  cutoff.  Without  the  use  of  CAD  the 
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“^dependent  cu^^TTere 
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r^vw'n/?.*^  a\erage  sensih^te/specificity  pairs  were  0.93/0.67,  0.92/0.77 
1.00/0.04  and  0.95/036,  ncspeclivclv.  ' 

CONCH/S/ONS;  For  many  users,  different  forms  of  radiologists'  interpre- 
patient  management  decisions.  Di^ent  for^of 
rXn  may  also  result  in  different  conclusions  about 

^  ability  of  CAD  to  improve  the  radiologist  performance.  Finally,  when 
dctemmmg  patient  management  from  confidence  ratings  user-deiiendenl 
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Compirter-aWod  Diagnosis  Scheme  for  Identifying  Histological  Classifica¬ 
tion  of  Clustered  Microcalcifications  Using  Follow-up  Magnification  Mam- 
mo^rams 
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mam^^i.11^’  clustered  microcalcification  on 

mammogranw  be  difficult,  and  thus  often  requires  follow-up.  The 
purjfx.se  of  this  study  was  to  develop  a  computer-aided  diagnosis  ^eme 
fonotTu^l  classification  of  clmfered  micioca^fications  on 

mammograms  in  order  to  assist  radioiogusts' 
mterpretation  as  a  second  opinion."  “ 

METHOD  AND  I^TERIALS:  Our  database  consisted  of  1 86  magnification 
mammoj^am-s  wluch  included  current  and  pres-ious  images  obt^i^  from 
M  pa  i^ts  after  three  montiis  foUow-up  It  included  1 1  invasive  carcino¬ 
ids,  19  nonmvasive  carcinomas  of  comedo  t>T>e,  25  noninvasive  carcino- 
23  mastopafhies  and  15  fibroadenomas.  The 
histological  classifications  of  all  clustered  microcalcifications  were  proved 
by  pathol^c  diagnosis  after  three  months  follow-up.  The  clustered 

By  use  of  a  filler  hank  and  a 
hresholding  technique  Five  ol^ective  features  on  clustered  microcalcifica- 

of  current  and  preefious  images  bv  taking 
accent  subjective  features  that  experienc-^  radiologists  co^onl? 

possible  histological  classifications.  BavS^  discriminant 
^ction  WM  employed  for  distinction  behveen  five  histological  classifica- 
For  the  mput  of  Bayes  discriminant  function,  we  us^  five  objective 

and  also  ten  objective  teatuies 
oblauied  from  both  current  and  prec-ious  images 

classification  accuracj-  ranged  from 
tv  .T",  ^^'.B"‘;''as  unproved  substantially  by  use  of  previous  images. 

ten  objective  fc^res,  the  classification  accuracies  of  our  computer¬ 
ized  scheme  were  100%<I1/1I)  for  invasive  carcinoma,  100%(19/19)  for 
comedo  h-pe,  96.0%(24/25)  for  noninvasive 

os^srr  fil=l° 

^NCLUSIONS:  By  use  of  both  current  and  previous  images,  this 
^mputenzed  scheme  achieved  high  classification  accuracies,  ai^  would 

n  assessment  of  clustered  microcalcifi- 

calions.  (S.K,  K.D.  are  shareholders  in  R2  Technology-.) 

SSC17-06  •  11:20  AM 

Automated  Regional  Registration  and  Claasmcatlon  of  Corresponding 
Microcalclficatlon  Clusters  on  Serial  Mammograms 

•  fl  Sahinef.  PbO*  •  M  A 

X  ^Bou.  PbO*  dt^iskOomlch  adu) 

PU^Of£_  To  deve  op  an  automated  system  for  detecting  corresponding 
imerecalcification  clusters  on  serial  mammograms,  and  classifying  the 

mterx  aV!Smge  inform^^  * 

^THOD  AND  AdATERMf.5.  Our  system  consists  of  two  stages.  In  the 
first  stage,  based  on  the  location  of  a  detected  cluster  on  t^  current 
mai^ogr^i,  a  regional  registration  procedure  identifies  the  local  .area  on 

corresponding  cluster.  A  search  program  is 
used  to  detect  cluster  candidates  within  the  local  area.  The  clusteron  the 
ts  then  paired  with  the  candidates  to  form  true  (TP-TP)  or 
«m«ipondence  cUssifier  using  automatically  ex- 
Bn  the  second  stage,  a 

temporal  classitier  based  on  current  and  prior  information  is  used  if  a 
cluster  IS  detected  in  the  prior,  and  a  current  classifier  based  on  current 

270 


infoi^hon  alone  is  used  if  no  prior  cluster  is  detected.  In  this  study  175 
H  Bi^y-proven  calcification  clusters  were  used'.  An 

''^®”BBed  the  corresponding  clusters  on  the  mammo¬ 
grams.  On  the  pnors,  the  radiologist  rated  12  of  the  175  clusters  as  not 
visible  and  the-  subtlety-  of  18  dusters  as  9  or  10  on  T^ate  of  V 
^tior"™"  '^**^“*  resampling  w  as  used  for  feature  selection  and  cla.ssifi- 

RESULTS:  The  .^rch  program  detected  90.2%  (158/175)  of  the  clusters  on 
^  ‘'•■th  an  average  of  0.43  FPs/image.  The  corresj^^re  i 

(1-I9/175)  of  the  TP-TP  pfirs  with  15  falsnilS  ’ 
dr^rected  clusters.  In  the  classificafio^ 
stage  foe  temporal  classifier  achiey-ed  a  test  Az  of  0A3  for  the  164  pairs  for 
efr^-mg  the  dusters  .ys  malignant  or  benign.  For  the  11  clusters  witlx^t 
detection  on  the  p^r,  the  test  Az  by  the  current  classifier  was  0:^T 

and 

BUerval  change  anal>sis  sy  stem  can  detect  the 
co^^i^mg  d^r  on  the  prior  mammogram  with  high  sensitivitv,  and 
^ify  them  with  an  accuracy-  compansble  to  that  an  ex^rienced  ra'diolo- 

SSC17-07  •  11:30  AM 

Pr^ot^  Aialysis  a  Clinical  Breast  Ultrasound  CAD  Workstation 

HGn^sk^*.  ,L  .  K.  Orukkor.  PfiO*  .  c.A.  Sanneff.  MD*  .  L  Lan 

MS  M  L.  Gtgar.  PhD  •  /.  Bonta.  MD*  (hgirusz/Ooic.edb) 
w-ofk^H™^"  ey-aluate  the  impad  of  a  proioty-pe  breast  ultrasound  CAD 
'L  ;'«''‘‘flow-  in  a  clinical  environment,  including  its 
4  X  equipment  and  its  use  by  radiologUts. 

oretoK?^  '^^iP  We  designed  a  breast  ultrasound  CAD 

preto^  pe  wo^stahon  to  be  utilized  in  a  clinical  setting  based  on  previous 
"'ith  retrospeclivc  image  analysis  using  CAD  methodology.  In 
Its  curiCTI  implementation,  the  workstation  is  configured  as  an  a^ve 
‘’“iV  II“"‘  “Brasound  scanner,  an  HDl  5000  by  Philips 
DICOM  automatic  image  transfer  meth^  usi^^ 

“^Sution  ^  to  transfer  images  directly  from  the  scanner  to  fof 
worksution.  There,  the  images  are  automatically  prepryxessed  and  all 

^fr^  Th^H^i  “^”";;"‘*on  Ls  remoyed  using^in-house  dey-eloped 
^fiw  are.  The  radiol^ist  then  performs  lesion  identification  on  each  im^ 
usmg  our  customized  graphical  user  interface.  For  the  purposes  of 
w^kfk,w-  study,  cad  analysis  results  were  not  displayed^fo  order  to 
pw  ent  influmre  of  patient  care.  Usmg  our  profoly-pe  workstation,  images 

Participatefo  thus  study  -  w-fue 
wa^Ti^-r^  .'’Tf  By  a  single  radiologist.  The  radiologist 

w^ow  ^  determme  the  workstation's  overall  impact  on  dinfoal 

®BIe  to  completely  interoperate 
equipment.  The  amount  oFume 
'workstation  varied  substan- 
^IV  Howey  er,  the  radiologist  reported  that  the  work.siatlon  added  imlv 

lur^^d”l'orkTp*^ 

CC^LUS/^S;  The  use  of  our  prototype  yvorkstation  has  no  significant 
Tt^unri  •<*  perform  a  standard  diagnosliTbrcASI 

and  on  the  clmical  ivorkfloyv-  in  general.  Our  design 
into  a  standard  clinical  environment  due  toS 
iT  sBindard  DICOM  protocols.  (M.L.G.:  M.  U  Giger  is  a 

^areholder  in  M  Technology,  Inc  (Sunny-y-ale,  CA).  It  is  the  policv^of  the 
ted C^cago  that  investigators  disdose  publicly  acti!2l  or'  poten- 

Sti«T"“ 

SSC17-08  •  11:40  AM 

Volumetric  BreaM  Density  EstimaMon  from  Full  Field  Digital  Mammograms 
N  fCarssern^,  PhD  .  N^magen,  Nelherlanas  •  S.  van  Engaiand*  •  C.  Soefes, 

•  W.  Hitfsman.  PhD*  •  P.  Snoersn,  PhD* 

density-  is  an  important  risk  factor  for  breast  cancer.  The 

vol^ofri^r’^.^  ®  accurate  estimation  of 

breast^ue  from  full  field  digital  mammograms  (FFDM) 

1  results  by  correlation  w  ith  volumes  of  denw  breast  tissue 

determined  from  breast  MRI  ussue 

METHOD  AND  MATERIALS:  Unptoce.s.sed  FFDM  images  aUow-  conver¬ 
sion  of  pixel  valu^  to  the  log  exposure  domain.  Bv  using  a  physical  model 
of  tmage  acquisiti^  this  allows  estimation  of  the  proportbn  of  dense 

‘"Bagration,  thevolume  of  dense 
physical  model  we  use  is  based  on  the 
i  ^  '*  "imposed  of  two  ty-pes  of  tissue,  fatty-  and 

de^.  Efrective  hnear  attenuation  coefficients  of  these  tissues  yvere  delved 
from  empirical  data  as  a  function  of  three  parameters:  Tube  voltage  (kVo) 

D'}ro\rm  i,"!?  “*?fion.  These  parameters  are  available  ^m 
DICOM  image  headers.  A  simple  relation  is  derived  that  aUoyy-s  compufa- 
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ABSTRACT 

We  are  developing  an  automated  system  for  analysis  of  microcalcification  clusters  on  serial  mammograms.  Our 
automated  system  consists  of  two  stages:  (1)  automatic  registration  of  corresponding  clusters  on  temporal  pairs  of 
mammograms  producing  true  (TP-TP)  and  false  (TP-FP)  pairs;  and  (2)  characterization  of  temporal  pairs  of  clusters  as 
malignant  and  benign  using  a  temporal  classifier.  In  this  study,  we  focussed  on  the  design  of  the  temporal  classifier. 
Morphological  and  texture  (RLS  and  GLDS)  features  are  automatically  extracted  from  the  detected  current  and  prior 
cluster  locations.  Additionally,  difference  morphological  and  RLS  features  are  obtained.  The  automatically  detected 
cluster  locations  on  the  temporal  pairs  may  deviate  from  the  optimal  locations  as  selected  by  expert  radiologists.  This 
will  introduce  “noise”  to  the  extracted  features  and  make  the  classification  task  more  difficult.  Linear  discriminant 
analysis  (LDA)  and  support  vector  machine  (SVM)  classifiers  were  trained  to  classify  the  true  and  false  pairs.  Leave- 
one-case-out  resampling  method  was  used  for  feature  selection  and  classifier  design.  In  this  study,  175  serial 
mammogram  pairs  containing  biopsy-proven  microcalcification  clusters  were  used.  At  the  first  stage  of  the  system,  85% 
(149/175)  of  the  TP-TP  pairs  were  identified  with  15  false  matches  within  the  164  image  pairs  that  had  computer- 
detected  clusters  on  the  priors.  At  the  second  stage,  an  average  of  7  features  were  selected  (4  difference  morphological,  1 
difference  RLS  and  2  current  GLDS).  The  LDA  and  SVM  temporal  classifiers  achieved  test  Az  of  0.83  and  0.82, 
respectively,  for  the  classification  of  the  164  cluster  temporal  pairs  as  malignant  or  benign.  In  comparison,  an  MQSA 
radiologist  achieved  an  Az  of  0.72.  Both  the  LDA  and  SVM  classifiers  were  able  to  classify  the  automatically  detected 
temporal  pairs  of  microcalcification  clusters  with  accuracy  comparable  to  that  of  an  experienced  radiologist. 


Keywords:  Computer-Aided  Diagnosis,  Interval  Changes,  Microcalcification  Classification,  Feature  analysis. 
Mammography,  Malignancy. 

1.  INTRODUCTION 

Mammography  is  currently  the  most  effective  method  for  early  breast  cancer  detection*’^.  Radiologists 
routinely  compare  mammograms  from  a  current  examination  with  those  obtained  in  previous  years,  if  available,  for 
identifying  interval  changes,  detecting  potential  abnormalities,  and  evaluating  breast  lesions.  It  is  widely  accepted  that 
analysis  of  interval  changes  in  mammographic  features  is  very  useful  for  both  detection  and  classification  of 
abnormalities^’"^.  A  variety  of  computer-aided  diagnosis  (CAD)  techniques  have  been  developed  to  detect 
mammographic  abnormalities  and  to  distinguish  between  malignant  and  benign  lesions.  We  are  studying  the  use  of 
CAD  techniques  to  assist  radiologists  in  interval  change  analysis. 

Commonly  used  classification  methods  for  CAD  use  information  from  a  single  examination.  These  methods 
have  been  shown  to  perform  well  in  lesion  classification  problems^’ However,  when  multiple-year  mammograms  of  a 
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lesion  are  available,  it  is  not  trivial  to  design  computer  vision  methods  to  use  the  temporal  information  for  computer- 
aided  classification  and  to  improve  the  differentiation  between  benign  and  malignant  masses. 

The  goal  of  our  research  is  to  develop  a  technique  for  computerized  analysis  of  temporal  differences  between  a 
microcalcification  cluster  on  the  most  recent  mammogram  and  a  prior  mammogram  of  the  same  view*^.  The  computer 
system  can  be  used  to  assist  radiologists  in  evaluating  interval  changes  and  thus  distinguishing  between  malignant  and 
benign  microcalcification  clusters  for  CAD.  It  will  also  be  useful  for  improving  the  identification  of  new  or  growing 
clusters  in  a  detection  system  or  for  improved  classification  of  malignant  and  benign  clusters  in  a  diagnostic  system.  In 
our  previous  studies  we  have  demonstrated  that  interval  change  analysis  can  improve  differentiation  of  malignant  and 
benign  masses*®’*^. 

The  purpose  of  this  study  is  to  compare  the  performance  of  different  classifiers  for  the  task  of  automated 
characterization  of  microcalcification  clusters  using  interval  change  information  on  serial  mammograms.  With  an 
automated  system,  noise  is  introduced  by  the  imperfect  registration  and  imperfect  detection  of  the  clusters  on  both 
current  and  prior  mammograms.  This  analysis  will  be  useful  for  understanding  the  advantages  and  limitations  of  the 
classifiers  when  performing  in  noisy  conditions. 

2.  MATERIALS  AND  METHODS 

Our  automated  system  consists  of  two  stages:  (1)  registration  of  corresponding  clusters  on  temporal  pairs  of 
mammograms,  and  (2)  characterization  of  the  temporal  pairs  of  clusters  as  malignant  and  benign  (Fig.  1).  A  description 
of  the  detection  method  and  the  database  used  is  given  below. 


Stage  1 


Stage  2 


Figure  1.  Block-diagram  of  the  registration  -  classification  CAD  for  temporal  microcalcification  clusters. 
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2.1  Registration  of  corresponding  microcalcification  clnsters  in  serial  mammograms 

In  the  first  stage,  based  on  the  location  of  a  detected  cluster  on  the  current  mammogram,  a  regional  registration 
procedure  identifies  the  local  area  on  the  prior  that  may  contain  the  corresponding  cluster.  An  automated  detection 
program  is  used  to  detect  cluster  candidates  within  the  local  area  (Fig.  2)  that  may  include  true  positives  (TP)  and  false 
positives  (FPs).  The  cluster  on  the  current  image  is  then  paired  with  the  candidates  to  form  true  (TP-TP)  or  false  (TP- 
FP)  pairs  (Fig.  2).  A  correspondence  classifier  is  used  to  reduce  the  false  pairs  (TP-FP).  This  technique  was  presented  in 
detail  previously*^. 


Current 

Figure  2.  Automated  registration  resulting  in  trae  (TP-TP)  and  false  (TP-FP)  pairs  by  pairing  the  cluster  on  the  current  image  with  the 
detected  TP  and  FP  candidates  on  prior  image. 


2.2  Classification  of  automatically  detected  and  registered  temporal  pairs  of  microcalcification  clusters 

In  the  second  stage,  a  temporal  classifier  based  on  current  and  prior  information  is  used  if  a  cluster  is  detected 
on  the  prior,  and  a  current  classifier  based  on  current  information  alone  is  used  if  no  prior  cluster  is  detected.  In  this 
study,  we  focussed  on  the  design  of  the  temporal  classifier  (Fig.  3).  Morphological  and  texture  features  including  run 
length  statistics  (RLS)  and  gray  level  dependance  statistics  (GLDS)  are  automatically  extracted  from  the  detected 
current  and  prior  cluster  locations.  Sixty  four  GLDS  features,  20  morphological  features  and  20  RLS  features  and  the 
cluster  size  were  extracted.  Additionally,  20  difference  morphological  and  20  difference  RLS  features  are  obtained  by 
subtracting  a  prior  feature  from  the  corresponding  current  feature.  The  automatically  detected  cluster  locations  on  the 
temporal  pairs  may  deviate  from  the  optimal  locations  as  selected  by  expert  radiologists.  This  will  introduce  “noise”  to 
the  extracted  features  and  make  the  classification  task  more  difficult.  Linear  discriminant  analysis  (LDA)  and  support 
vector  machine*^  (SVM)  classifiers  were  trained  to  classify  the  true  and  false  pairs. 
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pairs 

pairs 


Figure  3.  Block-diagram  of  the  temporal  classifier  for  classification  of  malignant  and  benign  microcalcification  clusters.  Both  TP-TP 
and  TP-FP  pairs  can  be  input  to  the  system. 


2.3  Data  set 

In  this  study,  175  serial  mammogram  pairs  containing  biopsy-proven  microcalcification  clusters  were  used,  of 
which  51  were  biopsy-proven  to  be  malignant  and  124  benign.  An  experienced  MQSA  radiologist  identified  the  gold 
standard  cluster  locations  on  corresponding  mammogram  pairs.  On  the  priors,  the  radiologist  rated  12  of  the  175 
clusters  as  not  visible  and  the  subtlety  of  18  clusters  as  9  or  10  on  a  scale  of  10  (l=most  obvious,  10=subtlest). 

The  mammograms  were  digitized  with  a  LUMISCAN  85  laser  scanner  at  a  pixel  resolution  of  50//m  X  50//m 
and  4096  gray  levels.  The  image  matrix  size  was  reduced  by  averaging  every  2x2  adjacent  pixels  and  down-sampled  by 
a  factor  of  2  to  obtain  images  with  a  pixel  size  of  \QQjjm  X  \QQjjm  for  analysis  by  the  computer. 

2.4  ROC  analysis 

Leave-one-case-out  resampling  method  was  used  for  feature  selection  and  classifier  design.  Stepwise  feature 
selection  was  used  to  obtain  the  best  feature  set.  To  evaluate  the  classifier  performance,  the  training  and  test 
discriminant  scores  were  analyzed  using  receiver  operating  characteristic  (ROC)  methodology^*’.  The  discriminant 
scores  of  the  malignant  and  benign  microcalcification  clusters  were  used  as  decision  variables  in  the  LABROCl 
program^’,  which  fits  a  binormal  ROC  curve  based  on  maximum  likelihood  estimation.  The  classification  accuracy  was 
evaluated  as  the  area  under  the  ROC  curve, 


3.  RESULTS 

At  the  first  stage  of  the  system,  85%  (149/175)  of  the  TP-TP  pairs  were  identified  with  15  false  matches  within 
the  164  image  pairs  that  had  computer  detected  clusters  on  the  priors.  Therefore  164  temporal  image  pairs  (149  TP-TP 
and  15  TP-FP),  of  which  49  were  malignant,  were  used  for  classification  as  malignant  and  benign.  At  the  second  stage, 
an  average  of  7  features  were  selected.  The  selected  features  included  4  difference  morphological  features,  1  difference 
RLS  feature  and  2  GLDS  features  from  the  current  image.  The  SVM  and  LDA  temporal  classifiers  were  used  for  the 
classification  of  the  164  cluster  temporal  pairs.  SVMs  with  dot  kernel,  radial  kernel  and  tanh  kernel  (neural  kernel)  were 
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studied.  The  training  and  testing  results  are  presented  in  Table  1.  The  SVM  classifier  achieved  test  of  0.82+0.03. 
The  LDA  classifier  achieved  test  A^  of  0.83+0.03.  In  comparison,  the  MQSA  radiologist  achieved  an  A^  of  0.72+0.04 
for  the  164  temporal  pairs.  The  difference  in  A^  between  any  one  of  the  classifiers  and  the  radiologist  did  not  achieve 
statistical  significance. 


Table  1.  SVM  classification  results  based  on  7  selected  features. 


SVM  Type 

Training  Az 

Test  Az 

Number  SV 

Dot  Kernel 

0.87+0.03 

0.82+0.03 

78 

Radial  Kernel 

0.86+0.03 

0.81+0.03 

89 

Tanh  Kernel 

0.86+0.03 

0.82+0.03 

83 

4.  CONCLUSION 

We  studied  the  use  of  the  LDA  and  SVM  classifiers  for  the  classification  of  automatically  detected  temporal 
pairs  of  microcalcification  clusters  as  malignant  or  benign.  Both  the  LDA  and  SVM  classifiers  were  able  to  classify  the 
automatically  detected  temporal  pairs  of  microcalcification  clusters  with  an  accuracy  comparable  to  that  of  an 
experienced  radiologist.  The  SVMs  with  dot  kernel,  radial  kernel,  and  tanh  kernel  had  comparable  performance  with  the 
radial  kernel  being  slightly  worse  than  the  other  two.  The  LDA  and  SVM  classifiers  also  had  comparable  performance. 
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PURPOSE 

To  evaluate  the  effects  of  computer-aided  diagnosis  (CAD)  on  radiologists'  characterization  of 
clustered  microcalcifications  on  serial  mammograms. 
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METHOD  AND  MATERIALS 

We  developed  an  automated  CAD  system  to  register  and  classify  microcalcification  clusters  as 
malignant  or  benign  based  on  interval  change  information  on  serial  mammograms.  In  this  study, 
we  conducted  observer  performance  experiments  with  receiver  operating  characteristic  (ROC) 
methodology  to  evaluate  the  effects  of  CAD  on  radiologists'  estimates  of  the  likelihood  of 
malignancy  (LM)  of  the  clusters.  175  pairs  of  serial  mammograms  from  65  patients  containing 
clustered  microcalcifications  (51  malignant  and  124  benign)  were  chosen  from  patient  files  with 
IRB  approval.  The  true  cluster  locations  were  identified  by  an  MQSA  radiologist  on  all  current 
mammograms  and  164  of  the  priors.  Regions  of  interest  containing  the  corresponding  clusters 
were  extracted  from  the  current  and  prior  mammograms  of  each  pair  and  analyzed  by  the  CAD 
system.  All  cases  eventually  underwent  biopsy  so  that  interval  change  was  observed  for  most  of 
the  clusters  even  if  they  were  found  to  be  benign  after  biopsy.  This  was  therefore  a  difficult  data 
set  for  interval  change  analysis.  Four  MQSA  radiologists,  different  from  the  one  who  identified  the 
clusters,  assessed  the  temporal  pairs  that  were  displayed  side-by-side  on  a  workstation.  The 
cases  were  read  in  a  counter-balanced  design.  The  readers  provided  estimates  of  the  LM  and  BI¬ 
RADS  assessment  without  and  then  with  CAD.  The  classification  accuracy  was  quantified  by  the 
area  under  ROC  curve,  Az. 


RESULTS 

The  CAD  system  alone  achieved  a  test  Az  value  of  0.83  for  this  data  set.  For  the  four 
radiologists,  the  average  Az  in  estimating  the  LM  was  0.70  (range:  0.64-0.75)  without  CAD  and 
improved  to  0.77  (range:  0.68-0.83)  with  CAD.  The  improvement  was  statistically  significant 
(p=0.04).  The  ROC  study  is  ongoing  to  include  additional  radiologists  as  observers. 


CONCLUSION 

CAD  using  interval  change  analysis  can  significantly  improve  radiologists'  accuracy  in 
classification  of  clustered  microcalcifications  on  serial  mammograms. 


CLINICAL  RELEVANCE/APPLICATION 

CAD  using  interval  change  analysis  may  increase  the  positive  predictive  value  of  mammography. 
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PURPOSE/AIM 

(1)  To  understand  the  design  of  an  observer  performance  study  for  characterization  of 
microcalcification  clusters  on  serial  mammograms  without  and  with  CAD.  (2)  To  understand  the 
effects  of  CAD  on  radiologists'  assessment  of  the  likelihood  of  malignancy  (LM)  of  clustered 
microcaicifications. 
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CONTENT  ORGANIZATION 

We  will  describe  our  CAD  system  that  performs  automated  registration  of  corresponding  clusters 
on  serial  mammograms  and  classifies  malignant  and  benign  clusters  based  on  interval  change 
information.  We  will  describe  the  design  of  our  multi-reader  multi-case  ROC  study,  and  discuss 
the  effects  of  CAD  on  radiologists'  estimates  of  the  likelihood  of  malignancy  of  microcaicifications 
using  ROC  analysis  and  on  radiologists'  biopsy  recommendation  decision  by  analysis  of  their  BI¬ 
RADS  assessment. 


SUMMARY 

CAD  using  interval  change  analysis  may  be  useful  for  assisting  radiologists  in  classification  of 
microcalcification  clusters  and  thereby  reducing  benign  biopsies.  This  exhibit  will  lead  to 
understanding  of  (1)  Design  of  a  CAD  system  using  interval  change  analysis.  (2)  Design  of  an 
observer  performance  study  for  evaluation  of  CAD  using  receiver  operating  characteristic  (ROC) 
methodology.  (3)  Interaction  of  observer  with  a  CAD  system  (4)  Effects  of  CAD  on  radiologists' 
decision. 
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